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E BEGIN with the observation that a 
WV person is constantly classifying the 
contents of the world about him. The child, 
for instance, is taught what kinds of stimu 
lation may be called green and what ones 
blue. Later he may likewise learn what 
modern Italian paintings are futurist in style 
rather than surrealist, or what people should 
be called communists. There seem to be 
certain common features involved in all such 
classifying, and they may be put as follows. 
Stimulation is received which the individual 
arrays along one or more innate or learned 
dimensions. Such input, along wider or nar 
rower extents of such dimensions, is asso 
ciated by repeated contiguity with a class 
name. It is in such a manner as this that 
the individual comes to learn what parts of 
the world of stimulation it is appropriate to 
call of yellow color, politically anarchisti 
or impressionistic in style of painting 
But the learning situation is often rife 
with elements of ambiguity. Was that color 
blue or green? Its hue was such that one 
cannot be sure. Here is a painting whose 
stvle seems to reflect both futurist and sur 


1 This monograph is based on a dissertation sub 
mitted in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy at Harvard 
University. The author is greatly indebted to 
Jerome S. Bruner, his principal adviser, for en 
couragement and aid in this research. He also 
wishes to thank Gordon W. Allport, John B, Car 
roll, George Mandler, and Evon Z. Vogt, the other 
members of his Thesis Committee, and Albert ] 
Caron, Paul Friedrich, and Frederick Mosteller, all 
of whom rendered assistance in various ways 
Special thanks are due Rainer Weiss for design of 
the electronic apparatus. This research was con 
lucted while the author held a‘ doctoral research 
fellowship from the Samuel S. Fels Foundation 
Support was also provided by Grant M-1324 from 
the United States Public Health Service, under the 
general direction of Jerome S. Bruner 
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realist components. Are this man’s activities 
really of a sort that justifies calling him a 
communist? With our world of fleeting 
stimuli, of quick looks, and conflicting 
opinions as to what class labels are appro 
priate in what situations, such ambiguities 
as these would seem to be common indeed. 


If one is to study how people learn to clas 
sify, then, it would seem necessary from the 
outset to recognize that the world more 
often than not provides a person with stimu 
lation that does not clearly fit the definition 
of any one class or label, but rather stands 
at the boundary of a particular label’s appli- 
cability ; it can be labeled with equal or near 
equal appropriateness by more than one 
term. Thus we come to the central question 
of our inquiry: Do any factors arising from 
the nature of classification itself influence 
how it proceeds in an ambiguous situation 
ie., a situation where alternative class 
names for the given event have been learned 
to equal degree ? 

There are at least two mechanisms arising 
from the way classes are defined which in 
fluence the tendency to assign information 
to a particular class: (a) whether a class is 
defined by inclusion or exclusion and (>) 
whether a class defined by inclusion is con 
trasted with a class defined by exclusion of 
the former class or with one also defined by 
inclusion. Consider the first question 
[here seem to be two main ways in which 
one can specify the contents of a class, and 
our hypothesis is that their psychological 
implications are quite different. On the one 
hand, the contents of a class may be speci 
fied by inclusion; such-and-such attribute 
values must be present for the datum to be 
given a particular class label. If, for in 
stance, something has wings, two legs, and 


fies through the air, we may appropriately 


| 
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classify it as a bird. On the other hand, the 
contents of a class may be specified by ex- 
clusion; if such-and-such attribute values 
are not present, the datum can be called a 
nonexemplar of the class or classes appro- 
priate to those attribute values. If, for ex 
ample, an animal does not spend all his time 
on land and/or in the water, does not have 
more than two legs, does not lack wings, 
then we may appropriately classify it as an 
animal residing somewhere other than on 
land or sea—and we will be referring here 
to the same domain of instances that we 
labeled before as “birds.”’ 

Given an inclusion and an exclusion class 
that both refer to the same domain of in 
stances, our hypothesis is that the range of 
acceptance limits for placing data in a class 
will be greater for the former than for the 
latter. An inclusion class, we 
carries with it the implication that it is well 
represented in the environment—that there 
are many events for which the class label in 
question has clear applicability. An exclu- 
sion class, on the other hand, is a more 
vague entity. It is as if people prefer ex 
amples of what something is, more than 
examples of what something is not. This in 
turn might be expected on the ground, for 
instance, that it is usually difficult to know 
how to deal with an event placed in an ex 
clusion class since the event could be any of 
a variety of things, or that work with an 
event defined by exclusion usually requires 
the added cognitive effort of transformation 
to a definition by inclusion. 

Our second suggested classification deter 
minant of range of acceptance limits is 
whether an inclusion-defined class of inter 
est is being contrasted with a class also de 
fined by inclusion or with the contrast class 
defined by exclusion of that class of inter 
est. We would propose that reference to 
some contrast class is always present, im 
plicitly if not explicitly, when we classify, 
and this for the following reason. In orde1 
to avoid being overwhelmed by the confus 
ing richness of attributes that the environ 
ment offers the senses, a person must define 
for himself in terms of a limited 
selection of properties. Consideration of a 
contrast class, in turn, usually provides on: 


suggest, 


a class 
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mechanism for such selection. If we make 
setters the contrast when defining the class 
“cocker spaniels,” for instance, then we may 
tend to take size as criterial for our defini- 
tion of the latter class. If, however, we take 
“all animals other than cockers” as the im- 
plied contrast, then we may well define 
cockers differently. Thus a contrast class 
too may be defined by inclusion or exclu- 
sion, and we hypothesize that this difference 
is a second determinant of the range of ac- 
ceptance limits for placing events in an in- 
clusion class of interest. If the contrast 
class is defined by exclusion of the class of 
interest, we predict that the acceptance 
limits for placing events in that inclusion 
class of interest will be broader than if the 
contrast another inclusion class. 
This should follow because we expect a con 
trast class defined by exclusion of a class to 
be relied on to a lesser extent than an 
inclusion-defined contrast class, hence giving 
the class of interest more opportunity in the 
former case to capture events away from 
the contrast class. 


class is 


VARIABLES AND HypoTHEsEsS UNDER STUDY 


Dependent Variable 

Range of acceptance limits for placing 
events in a class. The range of acceptance 
limits for placing events in a class is said to 
be greater, the greater the number of dis- 
criminably different events assigned to that 
class, and the higher the relative frequency 
of assignment of each such event to one 
class rather than to another class or other 
classes (or the greater the magnitude of 
some response indicating assignment to a 
class) 


Independent Variables 

Definition of a class of interest by in 
clusion or exclusion (positive or negative 
A domain of data is said to be de- 
fined by inclusion, or assigned to a positive 
class, if the definition of the class specifies 
the particular attribute values whose pres- 
ence is definitive of instances of that class. 
\ domain of data is said to be defined by 
exclusion, or assigned to a negative class, if 
the definition of the class specifies simply 


‘lass ) 


that the attribute values definitive of in- 
stances of some other class or classes are 
absent—hence, that instances of some posi- 
tive class or classes are absent. 

Presence of a positive or complementary 
negative contrast class for a positive class of 
interest. The class considered as a contrast 
to a positive class of interest may be the 
negative class complementary to that posi- 
tive class of interest (if the positive class of 
interest is called “X,’”’ the complementary 
negative contrast class is “X”), or the con- 
trast class may be some other positively de- 
fined class. 


Hypothesized Relationships Between 
Inde pendent Variables and the 
Dependent Variable 


Hypothesis 1: Concerning the influence 
of positive vs. negative classes on the range 
of acceptance limits. (a) For a class whose 
definition is never more than ambiguous, a 
positive class has a greater range of accept- 
ance limits for placing events in it than does 
a negative class associated with the same 
domain of events. (b) The same is true for 
a class whose definition is at least in part 
certain. 

Hypothesis 2: Concerning the influence of 
a contrast class, and how it is defined, on 
the range of acceptance limits for the class 
of interest. (a) Fora positive class whose 
definition is never more than ambiguous, 
the range of acceptance limits for placing 
events in that class will be greater when 
contrasted with its complementary negative 
class than when contrasted with another 
positive class. (b) The same is true for a 
positive class of interest whose definition is 
at least in part certain. 

It should be noted that we expect these 
relationships only to hold with regard to 
data that are generally ambiguous. [Data 
are maximally ambiguous if their relative 
frequencies of assignment to alternative 
classes are the same. Data are maximally 
certain, on the other hand, if their relative 
frequency of assignment to one class is 
100%. We are interested in the determi 
nants of S’s classification of generally am- 
biguous data because such ambiguity is the 
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case in those life situations for which the 
present hypotheses are supposed to be para 
digmatic. 


Now a class may be ambiguously asso- 
ciated with some data and associated more 
certainly with other data, or it may be only 
ambiguously associated with data. The 
question thus arises of whether the effects 
we postulate will hold only with regard to a 
class which is itself never more than am 
biguous, or rather will also hold regarding 
a class whose definition is at least in part 
certain. Hence Parts a and b of each hy 
pothesis. A class is maximally ambiguous, 
then, if no datum has more than an equal 
relative frequency of assignment to that 
class and to one or more other classes, and 


the greater the number of discriminable 
data of this kind. On the other hand, a 
class becomes increasingly certain, the 


greater the relative frequency of assignment 
of data to that class rather than to another 
class or classes, and the greater the number 
of discriminable data with a high relative 
frequency of assignment of this kind 


Design of Conditions 


We turn now to the matter of testing the 
hypotheses presented. The experimental re- 
quirements are these: (a) We must vary 
the way a class of interest is defined 
tively vs. negatively 


posi- 
meanwhile holding the 
contrast class constant. The logical impos- 
sibility of a class of interest and a contrast 
class both being negative may be noted.’ 
Hence, it follows that the contrast 
held here must be 
while interest varies from 
(b) We must also 
vary the way the contrast class is defined, 
meanwhile holding the type of definition of 


class 
constant 
the 


positive to negative. 


positive, 


class of 


the class of interest constant. Since we wish 
to compare the case where the contrast class 


7 It would be logically impossible to have both 


class of interest and contrast class negative, because 
two negative classes cannot apply to a 
when the two cl 
cannot both apply 
time) and are all-inclusive 


domain 
exclusive (1.e., 
same event at the same 


mutually 


isses are 


to the 
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is positively defined with the case where it 
is defined by exclusion of the class of inter- 
est, it follows that the class of interest held 
constant must be defined positively—again 
to avoid the logically impossible case of 
contrast class and class of interest both be 
ing negative. Further, we want situations 
where the classes whose acceptance limits 
we are determining are never more than 
ambiguous in definition, and other situations 
where these classes’ definitions are in part 
certain. 

Let us refer to a class defined positively 
as “X” or “Y,” and to one defined nega- 
tively as “X” or “Y.” Three conditions, in- 
volving association of these classes with a 
range of data in different ways, will permit 
us to test the above hypotheses. These con 
ditions are as follows: 

The XY condition. Both classes associ 
ated with the domain of data are positively 
defined. The association of Class X with 
that domain is in part certain, while the 
association of Class Y with that domain is 
never more than ambiguous. 

The XX condition. A positive class and 
its complementary negative class are asso 
ciated with the domain of data. The asso 
ciation of the positive class (X) with that 
domain is in part certain, while the associa 
tion of the negative class (X) with that 
domain is never more than ambiguous 

The YY condition. Again, a positive class 
and its complementary negative class are 
associated with the data domain. This time, 
however, the association of the positive 
class (Y’ with that domain is never more 
than ambiguous, while the association of its 
complementary negative (Y) is in part 
certain, 

How do these conditions permit us to test 
our hypotheses ? Consider Hypothesis la 
Tf we compare Conditions XY and XX, we 
are holding a contrast class constant and 
positive (X), and varying the definition of 
an ambiguous class of interest from positive 
(Y) to negative (X%)—which is what must 
lor Hy 


pothesis 1h, if we compare conditions XY 


be done to test that hypothesis. 


and YY, we are holding a contrast class 
constant and positive (Y), and are varving 
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the definition of a class of interest which 
is in part certain from positive (X) to 
negative (Y)—the requirement for testing 
that, hypothesis. lor Hypothesis 2a, we 
again compare Conditions XY and YY, 
since here the ambiguous positive class of 
interest is Y, and we compare a positively 
defined contrast class (X) and a comple- 
mentary negative contrast class (¥). Fi- 
nally, in the case of Hypothesis 2b, we con 
sider Conditions XY and XX again, for 
here the positive and, in part, certain class 
of interest is X, and we compare a posi- 
tively defined contrast class (Y) and a com- 
plementary negative contrast class (X). 
Note that the groups being compared are 
the same for Hypotheses la and 25, and for 
Hypotheses 1b and 2a. How, then, can each 
hypothesis in the pair be tested separately if 
the same conditions are compared for the 
two hypotheses? The answer lies in con- 
the measures we use to de- 
t range of acceptance limits. One 
kind of measure involves S’s predicting, for 


sideration of 


ermine 


each successively presented stimulus event, 
whether it will be given one or the other 
label, or refusing to predict. 
\nother kind of measure, after S has ex- 
perienced the whole set of associations be 
tween various stimulus events and the two 
class labels, involves S’s reporting, for each 


class his 


successively presented datum, whether it 
Was given one or the other class label more 
than half the time before, or was given each 
label about half the time before. Provision 
of the “refusal to predict” response choice 
in the first kind of measure, and of the 
“half half” choice in the 
second kind of measure, permit separate 


response 


testing of each hypothesis in the pairs just 
noted, even though both in the pair are 
tested by comparing the same two condi 
tions. This comes about because these re 
sponse choices allow the relative frequency 
of assignment of data to Class X to vary 
independently of the relative frequency of 
assignment of data to Class Y for the XY 
condition ; likewise, the relative frequencies 
of data assignment to Classes X and X can 
vary independently in the XX condition, 
and those to Y and Y can vary independ- 


4 


ently in the YY condition. These fre- 
quencies are independent since there is al- 
ways a third choice which S can make in 
any of the groups: he can refuse to offer a 
prediction or can say each label occurred 
about half the time before. (Let us abbre- 
viate responses of the latter two sorts as 
“OQ” judgments.) Calling the two class 
labels in each case (i.e., X vs. Y, X vs. X, 
Y vs. Y) Labels 1 and 2, respectively, we 
hence see that assignment of input to Label 
1 can change in relative frequency without 
a reciprocal change in relative frequency of 
assignment of input to Label 2 (or vice 
versa), because there is a “cushion” in 
between: namely, “O” judgments. 

Consider now what each of the hypoth- 
eses predicts concerning these relative fre- 
quencies. According to Hypothesis la, com- 
paring XY and XX, the relative frequency 
ot assignment of data to Class Y should be 
greater than that to Class X. By Hypoth- 
esis 1b, comparing XY and YY conditions, 
the relative frequency of assignment of data 
to Class X should be greater than that to 
Class Y. By Hypothesis 2a, again com- 
paring XY and YY conditions, the relative 
frequency of assignment of data to Class \ 
should be greater for the YY group, and 
less for the NY group. Finally, by Hypoth 
esis 2h, again comparing the XY and XX 
groups, the relative frequency of assign 
ment of data to Class X should be greater in 
the XX group, and less in the XY group 
It becomes clear, then, that the four hypoth 
eses can be independently tested if, as is the 
case, the relative frequencies of data assign 
ment to Classes X vs. Y, X vs. X, and Y 
vs. Y, are independent. 


Experimental Procedure 


Light intensity was used as the dimension of 
stimulus input, since it was a simple one and 
changes in its values could be easily created and 
measured. The S’s training regimen involved asso 
ciating brightnesses with labels or class names 

The questions of defining a class positively vs 
negatively, and providing a contrast class defined 
positively or defined by exclusion of the positive 
class of interest, were handled by varying the kinds 
of class 


labels provided. For Condition XY, 
where both labels were to be positive, when a 
brightness was to be linked with Label 1, a bulb 
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lit up the letter “X” 
to be linked with 
the letter “Y"—the “X” 


one on each side of the 


; and when a brightness was 
Label 2, a different bulb lit up 
and “Y” bulbs being placed 
screen where brightnesses 
were presented. When there are separate “X"" and 
“Y" bulbs and one of them 
thereby imply to S that there 
defined by particular vi 
bute 


lights each time, we 
are two classes each 
ilues of the brig 

1.€., two positive classes 
For Conditions XX and YY where, on the other 
hand, one label was to be positive and the 


tness attri 


other 
its complementary negative, when a brightness was 
linked with the label, a bulb lit: 
while when a brightness was to be linked with the 


it same bulb failed to light—the 


to be positive 


negative label, 


single bulb being placed underneath the = screen 
where brightnesses were presented, and no other 
bulb being present. When there is just one bulb 
which either lights or fails to light each time, we 
thereby imply to S that there is one class defined 
by particular brightness values and another class 
defined simply by absence ot those val ics 1.¢ i 
positive class and a complementary negative class 
respectively 

Note that the actual stimulus inputs of bright 
nesses and the relative frequencies of particul: 
associations of rig esses wit one ot two pos 
sible class labels, were alike in conditi What 
varied were the instructions provided by the light 
labels as to how the sses were defined, while the 
structures of the three pairs classes were exactly 
the same 

The numbe ent es used 
was 16, these be space " ile of just 
noticeable differences (see able 1) The total 
number of presentations of brightnesses during the 
Ww series W 160, each brightne coming up 

ies and the whole sequence | g uniquely 
random, but every brightness oc« ng twice m 
block of 32 presentations The relative fre 
The IND's were established by trials of the 


following sort: A rheostat was slow] 
ward from minimal brigl 
ss), brightnes 
ing recorded every time § 
able difference 


turned up 
wnward from 
level be 


reported a just 


tness (or d 
maximal brightn« 


notice 
Two upward series and two down 


ward series were given in single alternation to any 


one S. Sixteen was about the maximal number of 
IND’s detectable within the brightness range avail 
able from the rheostat. The E ran this procedure 
on himself and also two college student Ss 


Final brightness intensity settings were arrived at 
by taking all series in whicl 


1ad been provide 1 by the 


16 brightness values 
IND’s) 
and finding the median of the successive intensity 
settings for each of the 16 values 
ness values were provided i 


judge (i.e. 15 


Sixteen bright 
three of the four 
one judge, and in two of the 
by each of the 
different 


series by i four seri 
two other judges. (Fourteen or 15 
values were 


provided in the remaining 


series. ) 
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TABLE 1 


BRIGHTNESS LEVELS UsED 


Brightness Intensity 


(In DC ma. using a srightness No. 
60-w. bulb) 
1.00 (Brightest) 1 
.98 2 
.96 3 
88 
. 84 5 
80 6 
76 7 
72 8 
68 9 
63 10 
57 11 
53 12 
.48 13 
43 14 
.40 15 
.36 (Darkest) 16 


quencies of association of each brightness with 
each of the two labels used in any one condition 
are shown in Fig. 1. As indicated on that figure, 
toward the brighter end of the brightness dimen- 
sion all presentations were always associated with 
one label, while toward the darker end, all presenta- 
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two labels. The sequence of occurrence of the two 
labels in any condition was uniquely random, and 
was the same for the label of the same number (1 
or 2) in all the conditions. 

Seventy persons participated in the study, the 
number of university student Ss in Groups XY and 
XX being 23 each, while Group YY contained 24 
Ss. Procedure was the same ior all three groups 
except for the labeling differences among these 
groups as discussed above. This procedure fell into 
three parts and was arrived at as a result oi pilot 
investigation 

Training series. On each of the 160 trials, one 
of the 16 brightnesses was flashed for 0.5 sec. in a 
uniquely random order. After each brightness was 
presented, S pushed one of three buttons, indicat- 
ing, if in Group XY, that he bets the “X”’ light 
will go on, the “Y” light will go on, or that he 
refrains from making a bet, respectively; if in 
Groups XX or YY, that he bets the single light 
will go on (“Yes”), the single light won't go on 
(“No”), or that he refrains from making a bet, 
respectively. When S pushed any button, the “X” 
or “Y” light went on for 1.0 sec. if in Group XY, 
or the single light went on for 1.0 sec. or remained 
off if in Groups XX or YY—which of the two 
class labels occurred depending on the brightness 
that occurred on that trial. After a pause of 5.5 
sec. following a button push, the next brightness in 
the sequence flashed, thereby beginning the next 
trial. Thus, in the course of the 160 training trials, 
each brightness was linked with one or both labels 
according to the pattern of association diagrammed 


tions were equally often associated with each of the — in Fig. 1 
10 Oo 0 0 1 85 5.8 & 
ww 
FIO 
VAY At 
z VAVZ 
z SF 
VAVAVA LABEL 2 
/ 
VAVAVAVA Y FOR GROUP X - Y 
VAVAVA VALS} X FOR GROUP x - 
5a Y FOR GROUP Y - Y 
Or VAAVA A 
ne VAVAVAVAVAYW 64 
AY Af 4 
“3 
A A 4 A a 
TRIM VAL AVAYA 
Dw 4, 
VAL LABEL 1 J VAVAVAVAY, 
X FOR GROUP X-Y 4% 
© X FOR GROUP X-X , IWWAVAVAL 
WwW 2 V/AV/ 
4 A A AV/AV/AAWY 
2a AW. FOR GROUP Y-Y 4 
at WAY AVAY 
2 8 9 10 i2 13 66 
DARKEST 
BRIGHTNESS VALUES 
Fic. 1, Degrees of association of each brightness with Labels 1 and 2 


Test with brightnesses. On each of 32 trials, 
one of the 16 brightnesses was flashed for 0.5 sec. 
in a uniquely random order, each brightness oc- 
curring twice during the series. After each bright- 
ness was presented, S, if in Group XY, indicated, 
by pushing a button labeled “More,” that he thinks 
this brightness turned out to be X more than half 
of the times it was previously presented or turned 
out to be Y more than half those times, and then 
told E verbally whether he meant X or Y more 
than half the time before; or § might push another 
button labeled “Equal,” to indicate that he thinks 
this brightness turned out to be X about half those 
times and Y about half those times. If in Groups 
XX or YY, S indicated, by pushing the button 
labeled “More,” that he thinks this brightness 
caused the light to go on more than half of the 
times it was previously presented or caused the 
light to stay off more than half of those times, and 
then told E verbally which he meant; or S might 
push the other button labeled “Equal” to indicate 
that he feels this brightness caused the light to go 
on about half those times and to stay off the other 
half. No label lights went on during this test 
series. Five and one-half seconds after S pushed 
a button, the next brightness in the sequence 
flashed, thus starting the next trial.* 


4 Use of the two buttons noted in this test with- 
out brightnesses, rather than providing three but- 
tons for each of the three alternative decisions, 
was instituted as an added control to insure that S$ 
was pushed toward veridicality of report rather 


than away from it. A single button labeled “More” 
and another one labeled “Equal” imply to S that 
there might well be some brightnesses linked to 
one label more than half the time, and about as 
many other brightnesses linked to each label equal 
proportions of the time—which is in fact the case. 
The implication tendered, in other words, is that 
there is warrant to press the “Equal” button on 
about half of the test trials. To follow this impli- 
cation would lead S toward veridicality. If non- 
veridicality is obtained, then, in that Label 2 is 
thought to apply more often than was the case, this 
tendency was not fostered by the mere physical 
conditions of the test itself. 

The instructions given the various groups again 
insured that S was pushed toward veridicality of 
report rather than away from it, in that the ex- 
amples given of kinds of label-input associations 
told only of Label 1 being linked with a brightness 
shade all the times that shade is presented, or most 
of the times it is presented, or about half the times 
No example was mentioned where Label 2 was said 
to be linked with a brightness shade more than 
half the time, akhough of course this possibility 
was not ruled out. Once again, if nonveridicality 
is obtained in that Label 2 is thought to apply 
more than half the time to some brightness, then 
this decision was not fostered by bias in the com- 
municated instructions. 


CLASSIFICATION REQUIREMENTS ON GRADIENTS 0} 


RESPONSE 


/ 


Test without brightnesses. The S was shown a 
sheet of paper along which a horizontal line was 
drawn to represent the dimension of brightnesses 
to which he had been exposed, the word “Darkest” 
appearing at one end of the line and “Brightest” at 
the other.5 Along a vertical line at the left, divided 
into 10% steps ranging from 0% to 100%, appeared 
the legend “Proportion of times you think each 
brightness was labeled ‘X’” if for Group XY; 
while the legend for Groups XX and YY read 
“Proportion of times you think each brightness 
made the light go on.” The S was requested to 
draw a gradient indicating the proportion of times 
he thinks each brightness was labeled “X” (Group 
XY), or made the light go on (Groups XX and 
YY), for all parts of the brightness dimension. 

The choice measure in the training series indi- 
cated the learning of tendencies. The 
choice measure in the test with brightnesses pro 
vided a more perceptually based indication of the 
labeling tendencies thus learned (since the bright- 
nesses were presented 


labeling 


again), while the graph 
measure in the test without brightnesses provided 
a more conceptually based index of the same thing 
(since the brightnesses were not presented), La- 
tency measures in the training series and the test 
with brigiitnesses told 


us how confident S was in 


his labeling decisions 


1pparatus 


The time lengths for presentation of the bright- 
nesses and of the label lights, and for the interval 
between S’s pushing of a button and the occur- 
rence of the next brightness in the sequence, were 
automatically controlled by timers. Pre-established 
programs on stepping switches automatically con- 
trolled the sequences in which the brightnesses 
occurred and in which the two labels in any condi- 
tion occurred. The brightness differences were dif- 
ferences in the voltage settings of 16 rheostats, 
some one of which controlled the lighting of a 
common incandescent bulb for each trial. This 
high degree of automatic control of the sequence 


and timing of stimulus and label materials was 
required in order to free E to record the various 


items of data that S provided, while still maintain- 
ing strict temporal control over presentation of 
stimulus input and label materials 

The sequence of events in time is shown in Fig 
2, and the block diagram of Fig. 3 indicates the 
various apparatus components and how they were 
linked so as to result in this sequence. The push- 
buttons for the test series were mounted on one 
portable board while those for the training series 
were mounted on another, and one set of push- 
buttons was placed out of S’s sight when he was 


° For half the Ss in each group, “Darkest” ap- 
peared at the left and “Brightest” at the right end 
of the line, while for the other half these word 
positions were reversed 
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matically. When the end of the series was reached 
{0.5 sec. and E wished to set up the sequence for the next 
= - part of the experiment or for the next S, E re- 
turned the program sequences to their starting 
— — position by pressing the reset button, which homes 
all the stepping switches. The timer utilized was a 
Standard Electric Time Co. product, Type S-1 
| Neon | Delay A regular 60-w. lamp of white color was the 


gram sequence (start). = 


First brightness off. || Reaction time 
Timer on. |Jof S. 


S presses button. 
Timer off. 
| 


Appropriate neon indicator 
i | |delay, \ II, brightness intensity bulb, and it was mounted be- 
Particular pay-off light on (1.0 (1.0 Delay _ hind a circle of translucent blue celluloid 1.5 in. in 
or off as in program S€- || sec. | sec. Ill. | diameter, so brightness differences were various 
— depending on the | hse intensities of blue light. Pay-off lights were 3-w 
eevee grep and on train- us lamps mounted in pilot light fixtures. 
ing or test condition. |) ve The experiment was run in semidarkness, room 
Neon indicator light off. | lights being off and the only source of illumination 
Pay-off light off if it went other than the brightness intensities and pay-off 
on before. lights being a small 25-w. reading lamp placed well 
Stepping switch advances. | ___ behind a black curtain on E’s desk. The S sat 
Second brightness on in pro- 3 ft. from the light panel and at eye level with it 
gram sequence. Figure 4 shows a sketch of the apparatus from 
. etc. S’s point of view 
RESULTS 


Itc. 2. Sequence of events in time 
Tables 2~6 summarize the results obtained 
using the other, Neon indicator lights were Sing the measures outlined above. It will 
mounted on E's panel to tell him which button be noticed that when results are presented 


S pushed on that trial. The neon delays resulted concerning various parts of the whole 
in a neon light’s staying on for 1 sec. after S re- 
leased the button, so E was sure to see which light 


went on. The & began the brightness sequence in ® This semidarkness was maintained during the 
the training or test series by pressing the start training series, during the instructions for the test 
button, and from then on the series proceeded auto- with brightnesses, and during this latter test itself 


PAY-OFF LIGHT 1 1S IN ON POSITION PAY-OFF LABELS FOR S 17 
DURING TRAINING OF GROUPS ANDO x-y 


A PAY-OFF LIGHT 1 1S IN OFF POSITION 
OURING TRAINING OF GROUP Jj-y, | 

AND TEST OF ALL GROUPS 

PAY-OFF LIGHT 2 IS IN ON POSITION 4 
} 


BRIGHTNESS 
INTENSITY 
FOR S 


OURING TRAINING OF GROUPS x-y AND J-y. 
PAY-OFF LIGHT 2 IS IN OFF POSITION ° c 
DURING TRAINING OF GROUP x-i, OFF 
AND TEST OF ALL GROUPS. 


DELAY I 4 OELAY 
0.5 Sec + 


Ono ONO 


PUSH BUTTONS 


ADVANCE 
STEPPING FOR 
RESET SWITCH ons 


ON 


NEON OELAYS 
1.0 SEC 


NEON INDICATOR 

PROGRAM OF TIMER LIGHTS FOR E 

tne nn (LIGHTS 1 AND 2) FOR TRAINING FOR TEST 

(BRIG SERIES SERIES 


Fic. 3. Block diagram of the components of the apparatus. 


=f 
First brightness on in pro- 
| 
= 
j | OELAY | 
SEC 
FF 


dimension of 16 brightnesses, we frac- 
tionate that dimension into four parts of 
four brightnesses each and perform calcula- 
tions for each part separately, as well as 
examining the effect for all 16 brightnesses 
taken as a whole. Our reason for breaking 
the brightness dimension down into no more 
and no less than four parts, when examining 
trends across the dimension, is that four 
was the modal and median number of dif- 
ferent brightnesses which our Ss discrimi- 
nated in the course of brightness presenta- 
tions in the two random series. After 
finishing the test with brightnesses and be- 
fore starting the test without brightnesses, 
each S was asked how many different 
brightnesses he thought had been used—and 
the mode and median reply they gave was 
four. It hence seemed most psychologically 
meaningful to take this estimate as the 
number of segments into which to break 
the brightness dimension for analysis. 
Turning now to the findings in Tables 
2-5: For any measure, the results for the 
three groups of the experiment were first 
treated by the Kruskal-Wallis ranked anal- 
ysis of variance (Siegel, 1956). If the prob- 


| 
| | | 
\\ 


BRIGHTNESS 


NTENSITY 


SERIES BUTTONS 


TEST SERIES — 
BUTTONS 


GROUP X-Y 


Fic. 4. 
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The apparatus as seen by S 


ability of the distributions being from a 
common population was more than .10, the 
finding was considered not significant. If 
this probability was more than .05 but less 
than or equal to .10, the probability level 
was entered since it might be suggestive; 
but since the standard convention for sta- 
tistical significance was not met, no further 
test could be made for the measure in ques- 
tion. Finally, if the probability level was 
less than or equal to .05, we proceeded to 
determine whether this over-all significance 
of the ranked analysis of variance was due 
to the difference in the distributions of 
Groups XX vs. XY, XY vs. YY, or both." 
For comparisons of two groups, the Mann- 
Whitney U test (Siegel, 1956) was em- 
ployed. When the measure involves Label 1 
or Label 2, our hypothesis specifically pre- 


™ Such a procedure is analogous to following up 
the finding that a parametric analysis of variance 
is significant by using Tukey’s test for a significant 
gap to determine which intergroup differences cause 
the significant over-all effect 
warranted to do tests 
groups only if the 
groups is significant 


One is, of course, 
concerning a pair of the 
over-all effect for the three 


| | BRIGHTNESS 
{ | 7 PAY 
pay -OF F HT 
LIGHTS 
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dicts the direction of the difference betweeu 
Groups XX and XY, or between Groups 
XY and YY. Hence, when a Mann-Whitney 
test concerns Label 1 or Label 2, a one-tail 
test was made. When, on the other hand, 
the measure concerns refraining, ‘“‘equal”’ 
judgments, or the indifference point, we 
have alternative hypotheses which could pre- 
dict either direction of a difference, and 
hence a two-tail test was made.*® 

We offer the reminder that our hypoth- 
eses are expected to hold most strongly con- 
cerning the more ambiguous half of the 
brightness dimension—namely, Brightnesses 
9-16—since this is the area at the boundary 
of the category defined by Label 1. Our 
findings indicate that such, by and large, is 
the case: The hypotheses noted earlier are 
usually confirmed, in whole or in part, for 
Segments 9-12 and 13-16 of the brightness 
dimension, and are usually not confirmed 
for Portions 1-4 and 5-8. It hence requires 
a quite strong effect over Brightnesses 9-16 


®’ Whitney's extension of the U test to three 
groups (Mosteller & Bush, 1954) might seem 
appropriate for Tables 2-5, since this test permits 
one to evaluate the hypothesis that the distributions 
of three groups fall in a particular order. It could 
not be used, however, because there are large num 
bers of ties for scores among our three groups on 
a measure, and application of this technique when 
there are large numbers of intergroup ties involves 
theoretical difficulties 


DURING THE Last 32 


Group XX 
23) 


Group XY Group YY 
UN 3 


Label Label R Label R I abel 


{-4, 8.0 0.0 0.0 0.0 
median 

5-8, 5.0 0.0 1.0 7.0 1.0 0.0 5.0 1.0 0 
median 

9-12 4.0 1.0 |2.0 5.0 2.0 11.0 5 3.0 0 
median 

13-16 3.0 3.1 12.0 1.0 6.0 1.0 0.0 6.0 0 


median 


1-16, 
median 


1-16, 
mean 


MEDIAN FREQUENCY OF CHOICE OF LABELS | AND 2 AND THE 
TRIALS OF THE 


WALLACH 


for our hypotheses to be confirmed when 
the whole dimension—Brightnesses 1-16— 
is considered as a unit: yet such confirma- 
tion often occurs to at least some extent.® 


Label Learning (Choice Measure, 
Training Series) 


Tests of the frequencies of choice of 
Label 1, Label 2, and of the refraining but- 
ton during the last 32 trials of the training 
series, are presented in Table 2. From the 
choice frequencies for Label 1, we note that 
Hypothesis 1b, predicting XY > YY, is 
supported at the .07 level for Brightnesses 
13-16, and Hypothesis 2b, predicting XX > 
XY, is supported at the .01 level for those 
brightnesses. Neither hypothesis is sup- 
ported for other parts of the brightness 
dimension. The choice frequencies for 
Label 2 indicate that Hypothesis la, pre- 
dicting XX < XY, is supported at the .09 
level for Brightnesses 9-12, and at the .001 
level for Brightnesses 13-16. Hypothesis 
2a, predicting XY < YY, is supported at 
the .06 level for 9-12, but not for 13-16. 
The over-all effect for Brightnesses 1-16 
supports Hypothesis la at the .01 level, and 


® The measure of central tendency used is the 
median. Occasionally, however, significant effects 
occur which are not reflected in the medians for 
the groups in question, and in such cases means 
have been added. 


FABLE 2 


REFRAIN BuTTON (R 
[RAINING SERIES 


Mann-Whitney LU’ Tests 


Kruskal-Wallis 
Analysis ot 
Variance t 


Rank 


Between Groups 
XX and XY 


Between Groups 


XY and YY 


o1 001 <.02) <.01 <.001 <.03 <.07 N.S 


:| 

Lat rbel be bel bel | Lab 
Oo NS N.S. N.S 
N.S NS NUS 
NS os NS <.09 <.06 
0 
19.0 5.0 |7.0) 20.0 9.0 2.0) 15.5 11.0 Ox < .O8 03 


supports Hypothesis 2a at the .03 level. 
Regarding the frequencies of refraining, we 
note that XX > XY at the .03 level for 
Brightnesses 13-16, a finding that supports 
Hypothesis la and infirms Hypothesis 2. 
The findings of no significant difference for 
comparisons of other groups over Bright- 
nesses 9-12 and 13-16 may be supportive, 
in general, of all hypotheses in question, or 
may, on the other hand, simply be due to 
nonconfirmation of these hypotheses. Inter- 
pretation of failure to find a significant dif- 
ference between two groups in frequencies 
of refraining must always remain ambig- 
uous in this way. 


DURING THE 


Group XX 
V = 23 


Brightness 


Label 1 


Label 2 


1-4, median 
5-8, median 


5-8, mean 


9-12, 


13-16, median 


median 


1-16, mean 1; 


Kruskal-Wallis Analysis 
Brightness of Variance by Ranks 


abel 1 


5-8 N.S. <.01 N.S. 
9-1? < .001 < .001 
13-16 < 001 < 001 < 001 
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rABLE 3 


MEDIAN FREQUENCY OF CHOICE OF LABELS 1 AND 2 AND THE REFRAIN BUTTON (R 
Test With BRIGHTNESSES 


16, median 160.0 0.0 15.0 16 


between Groups Between Groups 


Label 2 x Lab 


Perceptually Based Indication of Labeling 
Tendencies (Test With Brightnesses) 
Table 3 concerns the frequencies of 

choice of Labe! 1, Label 2, and of the 

“Equal” button during the test with bright- 

nesses. With regard to choice of Label 1, 

we find Hypothesis 2b, that XX > XY, is 

supported at the .001 level for Brightnesses 

13-16, but Hypothesis 1b is not supported 

for those brightnesses, and neither hypoth- 

esis is supported for other brightnesses. 

For choice of Label 2, we find that Hypoth- 

esis la, predicting that XX < XY, is sup- 

ported at the .001 level for Brightnesses 

9-12 and 13-16, while Hypothesis 2a, pre- 


(;sroup Group YY 
V 23 \ 24 


Mann-Whitney U Tests 


XX and XY XY and YY 


1 Label 2 x Label 1 Label 2 Rk 


<.01 
001 < 001 <.01 N.S 
OO} < OO] < .02 < 06 


R Label 1 Label 2 Ik Label 1) Label 2 k 
0.0 0.0 0.0 0.0 0.0 0.0 
6.0 0.0 2.0 6.0 0.0 » 0 4.5 1.0 2.0 
0.1 0.7 1.0 
| 2.0 0.0 5.0 ».0 3.0 3.0 1.0 1.0 
; | 1.0 0.0 6.0 0.0 1.0 1.0 0.0 8.0 0.0 
LUG 11.0 0 12.5 12.0 7.0 
14.4 13.2 
| 
OO] 
1-16 <.Q2 < 001 < .09 < 001 < 09 <.01 N.S 
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dicting that XY < YY, is supported at the 
.O1 level for 9-12, and at the .02 level for 
13-16. Hypothesis la is also supported for 
Brightnesses 5-8 at the .01 level, and both 
hypotheses are supported for the brightness 
dimension as a whole—Hypothesis la at 
the .001 level, Hypothesis 2a at the .01 level. 
Concerning “equal” judgments, Hypothesis 
la, that XX > XY, is supported at the .001 
level for both 9-12 and 13-16, and this 
same finding infirms Hypothesis 2b. Also, 
XY > YY at the .06 level for Brightnesses 
13-16, a result that supports Hypothesis 2a 
and infirms Hypothesis 1b. The lack of 
significant difference between XY and YY 
for 9-12 may support both Hypotheses 1) 
and 2a, or simply reflect their nonconfirma- 
tion in this instance. Hypothesis la is also 
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supported by the finding that XX > XY 
for the whole brightness dimension, at the 
001 level. 


Conceptually Based Indication of Labeling 

Tendencies (Test Without Brightnesses) 

It is of interest to compare the indi- 
vidual’s judgments of the past relationship 
between perceived brightnesses and class 
labels—the test just described above—with 
his judgments of the past relationship be- 
tween points along the brightness dimension 
as he conceptualizes it, and those labels. 
The test without brightnesses provides an 
index of the latter, and our results indicate 
a high degree of match between an S's 
judgments on the two tests—and hence be- 
tween his implicit (more perceptual) and 


TABLE 4 
MEDIAN AMOUNT OF GRAPH AREA ASSIGNED TO LABELS | AND 2, AND TO 50% JUDGMENTS 
DuRING THE TEsT WitTHOUT BRIGHTNESSES* 


Group XX 
(N = 23) 
Brightness 


Label 1 | Label 


370.0 
262.0 
172.0 

60.0 


13 16, me 


1-16 1096 100.0 


Kruskal-Wallis Analysis 


Brightne of Variance by Ranks 


Label 1 Label 2 


N.S. 
< .03 
<.001 
< .001 


.05 
.03 


N.S. 
.001 N.S. 


.001 01 < 


<.001 | <.001 


<.07 


“> of a single brightness’ relative frequency 


Label 1 


Label 1 


Group YY 


Group XY 
( (N = 24 


N = 23) 


Label 2 Label 1 Label 2 


30. 
138. 
228. 


340. 


867.0 


Mann-Whitney U Tests 


Between ¢ 7roups 


XX and X\ 


Between Groups 


XY and YY 


Label 2 


Label 1 | Label 2 50°; 


.02 
001 N.S. 


001 .001 < 001 < .001 


1-4 350.0 50.0 0.0 0 0.0 344.5 sa. 0.0 
5-8 273.0 127.0 0.0 0 0.0 230.0 170.0 0.0 
9-12 224.0 176.0 0.0 0 0.0 145.0 255.0 0.0 
13-16 217.0 183.0 0.0 i 0.0 55.0 345.0 0.0 
$4.3 8.7 6.3 
733.0 50.0 756.5 843.5 50.0 
50°; 
| 
1—4 < 
5-8 < N 
q 9-12 < 
13-16 N.S 
1-16 
of occurren 


explicit (more conceptual) models as to 
what went on during the training series. 
Table 4 presents the findings concerning the 
test without brightnesses: For Label 1, we 
find that XX > XY at the .001 level for 
Brightnesses 9-12 and 13-16, and also at 
the .001 level for the brightness dimension 
as a whole. Since results for Labels 1 and 
2 are interdependent in the test without 
brightnesses, the above finding is also re- 
flected, for Label 2, in the result that XX 
< XY at the .001 level for 9-12, 13-16, and 
the whole brightness dimension. These find 
ings may support Hypotheses la, 2), or both. 
Analogously, that XY > YY at the .02 level 
for Label 1, Brightnesses 9-12, is reflected 
in XY < YY at the .02 level for 
Label 2, over the same brightness range 

results which may support Hypotheses 1), 
2a, or both. (This effect is also present at 
the .001 level over the whole brightness 
range.) Amounts of graph area for XY 
and YY are not different, however, for 
Label 1 or 2 over Brightnesses 13-16, which 
is contrary to Hypotheses 1b and 2a. Re 
garding the amount of graph area assigned 
to “equal” judgments, that XX XY at 
the .01 level supports Hypothesis la and 


also 


( srOUpS 
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infirms Hypothesis 2b. The absence of other 
significant differences for 9-12 and 13-16 
may support all the hypotheses in question, 
or simply be due to their nonconfirmation 
in these instances. 


Indifference Points 

Concerning the relative positions of the 
median indifference points, that XX > XY 
at the .01 level for the test with brightnesses 
and for the test without brightnesses (Table 
5) confirms Hypothesis la and infirms Hy- 
pothesis 2) in both cases; and that XY > 
YY at the .01 level for the test without 
brightnesses confirms Hypothesis 2a and 
infirms Hypothesis 1b. The nonsignificant 
differences between XY and YY for the 
test with brightnesses, and between XX and 
XY, and XY and YY for the training 
series, again may support all the relevant 
hypotheses or be an effect of their non 
confirmation. 


Reaction Times, Training Series, and 
Test Without Brightnesses 


If the reaction time measure is a valid 
index of judgment confidence, then latencies 
for predictions of Label 1 should drop as 


BRIGHTNESS L)IMENSION®* 


Mann-Whitney U’ Tests 


— Kruskal-Wallis 


Part of 
Experiment 


Last 32 trials of 
training series, 
median 


lest with 11.0 8.5 8.5 
brightnesses, (N = 23) | (N = 21 (N = 21) 
median 


lest without 21.3 8.5 8.5 
brightnesses, 
median 


Fest without 
brightnesses, (N = 22) | (N= 
mean 


* Each individual score is the median brightness 


Variance by 


Analysis of 
Between Groups | Between Groups 


XX and XY XY and YY 


Ranks 


< .001 


<.01 


<.01 <.01 <.01 


50% judgment category by each S 


= 
XX XY YY 
10.0 11.5 9.0 < 
(N = 21) | (N =17) | (N = 14) 
0 8.0 
23 (N = 24) 
assigned to thc 


14 


one moves from Brightness 8 to Bright- 
ness 1, since the association of brightness 
with Label 1 becomes more and more prob- 
able for all groups.'® Further, considering 
predictions of Label 2 for Brightnesses 9— 
16, Hypothesis la predicts a sharper drop 
in latency (i.e., a sharper degree of slope), 
as one moves from 9 to 16, for Group XY 
than for Group XX; and Hypothesis 2a 
predicts a sharper drop in latency over this 
brightness range for Group YY than for 
Group XY. We wished to measure, then, 
the degree of slope in Label 1 reaction times 
from Brightnesses 5-8 to Brightnesses 1-4, 
and in Label 2 reaction times from Bright- 
nesses 9-12 to 13-16. As the subject score 
for use in analysis of these drops in latency, 
a measure was needed which would be free 
of individual differences in absolute level of 
reaction time, since the absolute difference 
between Label 1 latencies for 5-8 and 1-4, 
or between Label 2 latencies for 9-12 and 
13-16, would be less for Ss whose reaction 
times are shorter in general. This would 
about “floor effect’: 
There will be less variability in Label 1 
latencies for 14 than for 5-8, and less vari 
ability in Label 2 latencies for 13-16 than 
for 9-12. As a result, faster Label 1 re 
sponders for 5-8 will show less of a slope 


come because of a 


to 1-4 than the slower responders simply 
because the fast people’s Label 1 responses 
to 1-4 cannot become much quicker than 
their responses to 5-8, while those of the 
slow people can. The same holds with re 


gard to Label 2 responders for 9-12 and 
13-16. 
To correct for this floor effect, it hence 


Was necessary to make the slope score a 
relative one—a ratio of the amount of 
Label 1 latency drop from 5-8 to 1-4 
divided by the total possible amount of 
drop; and similarly, a ratio of the amount 
of Label 2 latency drop from 9-12 to 13-16 
divided by the total possible amount of 
drop 


from Bright 
are difficult to make since the greatest 
concentration of reaction times for Label 1 is ex 
pected to be from Brightnesses 1-8, approximate] 
predictions 
1, 


Slope predictions for Label 1 
nesses 9-16 


would concern Hypotheses 


MICHAEL A. 


WALLACH 


With regard to the statistical tests used 
for reaction time data: The first matter of 
interest was to determine whether the de- 
gree of downward slope in reaction time 
found for each group—slope for Label 1 
responses from 5-8 to 1-4, or slope for 
Label 2 responses from 9-12 to 13-16- 
was significant or not. In the ¢ase of Label 
| responses, our prediction is that there will 
be a significant downward slope in reaction 
time from Brightnesses 5-8 to Brightnesses 
14, for all groups. With regard to Label 2 
responses, we expect that there will not be a 
significant downward slope from 9-12 to 
13-16 for Group XX, but that there will 
be a moderately significant downward slope 
for Group XY, and an even more signifi- 
cant slope of this kind for Group YY. In 
the cases where we predict a difference be- 
tween latencies for one group of bright 
nesses and another, then, our hypothesis 
predicts that this slope will take a particu- 
lar direction: downward from 5-8 to 1-4 
for Label 1 responses, and downward from 
9-12 to 13-16 for Label 2 responses. Since 
we are thus asking whether the slope is 
significantly different from zero in a certain 
direction, a one-tail test of the latency drops 
is in order. Wilcoxon matched-pairs signed- 
ranks tests (Siegel, 1956) were used for 
this purpose, and their results are entered 
in Table 6 

Concerning the drop in latency for Label 
1 responses from 5-8 to 1-4, we find the 
degree of this slope to be significant for all 
three groups during the last 32 trials of the 
training series, and also significant for all 
groups during the test with brightnesses. 
The levels met for various 
groups here range from .05 to .01. We also 
have no special reason to expect the degrees 
of slope to differ among the three groups ; 
for all conditions, confidence should be 
greater in applying Label 1 to Brightnesses 
1-4 than to Brightnesses 5-8, since associa 
tion of brightnesses with Label 1 is far from 
certain for 5-8, and almost certain for 14 
lo determine whether there were in fact no 
slope differences between groups, a Kruskal 
Wallis analysis of variance by 


significance 


ranks was 


: 


Group XX 
Part of 
Experiment 
Label 1 Label 2 


Last 32 trials of 
training series 


Fest with 
brightnesses 


Part ot 
Experiment 
For Group \N 


Label 1 Label 1 


Label 2 


Last 32 trials of 


N.S < .025 
training series 


brightnesses 


median RT of his Label 2 responses to Brightnesses 9-12. 


performed comparing the three groups dur- 
ing the last 32 training trials, and another 
was done comparing these groups during 


the test with brightnesses. As expected, re- 
sults for neither met conventional signifi- 
cance levels." 

Turning now to the latency drop for 
Label 2 responses from 9-12 to 13-16, we 


1 That results for the test with brightnesses are 
at the .09 level is probably due to chance, since no 
comparable trend occurred for the last 32 trials of 
the training series. We would expect a genuine 
trend to be reflected in both training and test 
series, since the trend for Label 2 on the test with 
brightnesses is, as we shall see later, quite con- 
sistent with the Label 2 trend for the last 32 train 
ing series trials 
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TABLE 6 
MEDIAN SLOPE OF REACTION TIME FOR LABEL 1 RESPONSES FROM MIDDLE OF BRIGHTNESS 


DIMENSION TO BRIGHTEST EXTREME, AND FOR LABEL 2 RESPONSES FROM MIDDLE OF 
BRIGHTNESS DIMENSION TO DARKEST EXTREME* 


Label 1 Label 2 Label 1 Label 2 


Wilcoxon Matched-Pairs Signed-Ranks Tests Kruskal-Wallis 


For Group XY 


Label 2 


Test with < .04 NS <.01 <.01 <.01 < 01 < 09 = 05° 


* Each individual Label 1 score is the median RT of S’s Label 1 responses to Brightnesses 5-8, minus median RT of his Label 1 
responses to Brightnesses 1-4, over median RT of his Label 1 responses to Brightnesses 5-8. Each individual Label 2 score is the 
median RT of S's Label 2 responses to Brightnesses 9-12, minus median RT of his Label 2 responses to Brightnesses 13-16, ove 


> P value for the hypothesis that the three medians lie in this order. 
¢ P value for the hypothesis that group YY's median is greater than the others. 


Group X\ oup YY 


Analysis of 
Variance by 


Group YY Ranks 


abel 1 Label 2 


Label 2 


find that for the last 32 training series trials 
this slope is not significant for the XX 
group, is at the .10 level for the XY group, 
and is highly significant (.01 level) for 
Group YY. For the test with brightnesses, 
the slope again is not significant for Group 
XX, but meets the .01 level for both XY 
and YY. These trends across groups are in 
keeping with our hypothesis, but the further 
task remains of determining whether indeed 
the slope for YY exceeds that for XY, and 
that for XY exceeds the XX group's slope, 
as predicted. Since we are predicting that 
the distributions of the slope scores will fall 
in a specific order for the three groups, a 
test by Whitney's extension of the U sta- 


07 03 Os 09 07 19 
(N = 23 N = 15 N = 23 N 22 V 24 (N = 23) : 
: 16 25 27 20 29 40 
(N = 23) NV = 4) (N = 22 V = 20 \ 23 (N = 24 
; Label 1 
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tistic to three groups is called for.'* Follow- 
ing this procedure, our hypothesis that the 
slope for Group YY exceeds that for XY, 
while XY’s slope exceeds XX’s slope, is 
significant at the .06 level for the last 32 
training series trials. On the test with 
brightnesses, the number of Ss in Group XX 
providing reaction to at least one brightness 
in the range 9-12 and also to at least one in 
the range 13-16, was so small that the hy- 
pothesis of a specific order of degrees of 
slope for the three groups could not be 
But another version of 
Whitney's extension of the Ll’ technique 
could be applied—-one testing the related 
hypothesis that the slope for Group YY is 
greater than that for the other two groups. 
This prediction was confirmed at the .05 
level. On the whole, then, the reaction time 
data support our expectations quite well." 


tested as such. 


'? This test can be used in the present case, 
whereas it couldn't be applied to Tables 2-5, be- 
cause the number of intergroup ties is minimal 
here. In dealing with those few ties that did occur, 
we followed Jonckheere’s (1954) recommendation, 
which is the most conservative policy 
namely, assigning the ties in a manner that favors 
the null hypothesis and is unfavorable to the hy- 
pothesis of interest to us Specifically, ties are ex- 
cluded from the sum when a large sum is favorable 
to our hypothesis, and ties are included in the sum 
when a small sum is favorable to our hypothesis 


interest to consider Cartwright's 
(194la, 1941b) work on decision time in the light 
if our reaction time findings for Label 2 responses 
Cartwright suggested the generalization that deci- 
ion time decreases as one moves away from the 
boundary and toward the category 
ilong some stimulus dimension. Our findings with 
regard to Label 2 responses suggest, however, that 
this is not invariably the case, but rather depends, 
n ambiguous situations, on whether the category 
in question is defined positively or negatively, and 
whether its contrast class is defined positively or 
is the complementary negative. For when the class 
of interest 1 negatively with a positive 
contrast no decrease in 

time as the the class approached 
from its boundary; when that class was positively 
defined with a positive contrast class, we found 
some decrease; and when that class was positively 
defined andthe 
omplementary, we 


yssible 


is of 


center of a 


was define: 
found decision 
act 


class, we 


center of was 


contrast class was its negative 


found the greatest decrease of 
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DISCUSSION 


Our Hypotheses in the Light of the Results 

Our hypotheses concerning acceptance 
limits for placing events in a class may be 
grouped in two ways. Some concern defi- 
nition of a class of interest by inclusion or 
exclusion (namely, Hypotheses la and 1b), 
while others concern definition of a contrast 
class by inclusion or exclusion (Hypotheses 
2a and 2)). One may also note that some 
the above differences when the 
class of interest is ambiguous (Hypotheses 
la and 2a), while others involve cases where 
the class of interest is partly certain (Hy 
potheses 1b and 2b). We find from Table 7 
that Hypotheses la, 2a, and 2b tend to be 
confirmed more strongly than 1b. The ap- 
propriate conclusions would therefore seem 
to be these: Whether the class of interest is 
defined positively or negatively, and whether 
the contrast class is defined positively or by 


concern 


exclusion of the class of interest, both in- 
fluence the range of acceptance limits for 
placing events in the class of interest, but 
the effect of the latter factor is somewhat 
greater than that of the former. Further, 
these effects are somewhat stronger when 
the class ef interest is fully ambiguous than 
when it is partially certain, although present 
to a considerable extent even in the latter 


ase 


TABLE 7 


EVALUATION OF THE FouR HYPOTHESES IN TERMS 
OF SOME OF THE RESULTS FOR BRIGHTNESSES 9-16" 


Hypotheses 


onfirmations 


Cyn. 
C/N, P < .05 


want 


*» We have arbitrarily assumed that each chance considered 
r confirmation of an hypothesis should be given equal weight 

ther, and show, for each hypothesis, the number of 
>) divided by the total number of possi- 
ion (N). Results for refraining, ‘‘equal"’ 
ference points are not considered here 
juality between two groups on those 
two hypotheses or, alternatively, may 

In the first row, we define confirma- 
ymparisons yielding P's of .10 or better 
or better 


row, .O5 


3 

la 1d 2a 2b 
/8 1/6 4/8 4/6 

‘ 
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Why do these effects occur? Let us con- 
sider each of our two basic hypotheses in 
turn. 


On Definition by Inclusion and Exclusion 


A reminder is in order concerning the 
nature of our experimental conditions: The 
implications as to positively and negatively 
defined classes for our groups were pro- 
vided by allusion, rather than by differences 
in logical structure of the classes. This was 
done so that the three groups would be 
equated with regard to the kinds and direct- 
ness of information they were obtaining, 
and would differ only in whether a class 
seemed to be defined in its own right, or 
rather by exclusion of some other class. If 
the Ss had been dealing differently with 
these two kinds of classes in their prior 
everyday experience (for whatever reason), 
we would thus expect such differences to 
transfer to the experimental situation, even 
though there was no advantage in the ex- 
periment proper from treating the two kinds 
of classes differently. Had there been such 
an advantage in the present experiment, we 
would never know if differences found be- 
tween groups were due solely and specifi- 
cally to the nature of this advantage or 
rather were due to the overwhelming influ- 
ence of class definition by inclusion vs. ex- 
clusion as such. Various factors active prior 
to the time of the experiment could, in turn, 
cause a preference for positive classes—but 
our basic question was: Does a preference 
for inclusion-defined classes exist that is 
strong enough to operate even when it does 
not give S an advantage in the present ex 
perimental situation ? 

The results of our research indeed sug 
gest that people have a prejudice of this 
kind in favor of classes defined by inclu- 
sion. There are several factors that may 
underlie this preference. 

Disjunctiveness of an exclusion class. To 
assign an event to an exclusion class is 
virtually a declaration of ignorance as to 
how to deal with it, since the number of 
possible alternative actions excluded by such 
assignment is usually much smaller than the 
number of alternatives that remain. If 
something is not a giraffe, for example, it 


may be a lion, or a man, or a dandelion, or 
any one of a myriad of remaining possi- 
bilities. The appropriate modes of conduct 
toward it if it is lion, man, or anything else, 
are quite different from one another, and 
we have no basis for choosing one over 
another. This is the point which seems to 
underlie Bartlett’s (1957) discussion when 
he notes that transfer of past learning con- 
cerning a class is possible when we are pre- 
sented with-new positive exemplars of that 
class, but not when faced with new negative 
exemplars of it. A negative instance of a 
class simply informs you that behavior ap- 
propriate to members of that class is not 
appropriate here, but it does not tell you 
what constitutes suitable behavior in the 
present The same matter is adum- 
brated by Hovland and Weiss (1953) when 
they make the following comment: “The 
positive instances define a coherent, well 
structured combination of characteristics of 


case. 


the concept, while the negative instances are 
in effect positive instances of what the con- 
cept ‘is not.’ They thus lack an organized, 
unitary quality, except for the limiting situ 
ation where only 
volved” (p. 181). 


Our study, of course, concerns just this 


a single dimension is in- 


limiting situation, since only one dimension 

brightness—was used. Although a class 
that we defined by exclusion was thus just 
as organized and unitary as one we defined 
by inclusion, our Ss dealt with exclusion 
classes as they might if their definition were 
disorganized, disjunctive. Such an effect 
may well be due to transfer from the many 
cases a person has encountered where ex 
clusion classes’ definitions indeed are dis 
junctive. 

Necessity for transforming information 
when working with an exclusion class. Even 


if there is actually only one possible alterna- 
tive class to which an event can belong 
when one knows that it is not an instance 


of some other class, the classifier now must 
usually engage in the added mental opera- 
tion of assigning the event to that alterna- 
tive class. (‘‘This is not-X: therefore it is 
such-and-such.”’) There is thus one more 
thing Ke must do than would be the case if 
he had been able to place the event in its 
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appropriate class at the outset. That indi- 
viduals dislike this extra operation is evi- 
dent from the preference for “direct test” 
found by Bruner, Goodnow, and Austin 
(1956) in their Ss. 

The extent of this preference is well illus- 
trated by comparing particular groups from 
a study by Bruner, Wallach, and Galanter 
(in press), and work by Goodnow and 
Pettigrew (1956). In both cases, S’s task 
was to learn that the correct pattern of 
pressing a left- and a right-hand button 
was left-left-right repeated. In the Bruner, 
Wallach, and Galanter experiment S was 
informed, on any trial, which of the buttons 
was correct: If the left button were correct 
on a particular trial, a left-hand light would 
go on after S had pushed either button; 
while if the right button were correct on 
that trial, a right-hand light would appear 
after S had pressed either button. In Good- 
now and Pettigrew’s study, on the other 
hand, S was forced to transform informa- 
tion concerning which button was not cor 
rect, on trials in which S erred, into knowl- 
edge of what button was correct for that 
trial: If S responded correctly on any trial, 
a poker chip fell out of a central chute; 
while if he made an error on a trial, no chip 
appeared. When he pushed the wrong 
button on a trial, then, S had to reason that 
this trial was not appropriately dealt with 
by that button (this trial in the sequence 
was not an instance of the class defined by 
that button), but rather should be handled 
by the button on the other side. In contrast, 
when the S in Bruner, Wallach, and Galan- 
ter’s experiment pushed the wrong button, 
he immediately knew which button was ap- 
propriate for that trial (i.e., he knew to 
which class—which button—this trial in the 
sequence should be assigned), since a light 
went on over the correct button. Positive 
classes thus were always utilized in the 
Bruner, Wallach, and Galanter study, since 
S always knew what button was appropriate 
on that trial. Goodnow and Pettigrew, on 


the other hand, provided their Ss with com- 
plementary negative classes when they made 
a mistake—the Fs told them directly only 
what button was mot appropriate on that 
That Goodnow and Pettigrew’s Ss 


trial. 


took about twice as long to learn the left- 
left-right pattern as Bruner, Wallach, and 
Galanter’s Ss, indicates how strong an aver- 
sion people have against transforming 
knowledge of what something isn’t into 
knowledge of what it is. 

In our experiment, once again, such trans- 
formation is not required of S when dealing 
with an exclusion class, since its negative 
nature is alluded to in the label and nothing 
more. Placing brightnesses in the exclusion 
class is no less direct than placing them in 
the inclusion class. Nevertheless, our Ss 
avoided exclusion classes as they might if 
there were added mental strain involved in 
their use. Such an effect again may be the 
result of transfer from the many cases a 
person has encountered where use of exclu- 
sion classes indeed does involve an extra 
step of reasoning. 

Difference in kind of response. Presenta- 
tion of an instance of an inclusion class 
requires one to make a particular suitable 
response of some sort, while presentation 
of an exemplar of an exclusion class re- 
quires one to refrain from making, or with- 
hold, the response that would be appropriate 
for an instance of the excluded class. Now 
it may well be that the tendency to make a 
response is stronger, for the same amount 
of training, than the tendency to withhold 
that that is, given a_ certain 
amount of practice in making or withhold- 
ing a response, the former tendency may be 
stronger than the latter. This would lead to 
a greater inclination to use inclusion classes 


response ; 


rather than exclusion classes simply because 
of the different kinds of responses involved 
in each 

In our research, again, such a response 
difference is not obviously involved, since 
there is no difference in the formal arrange 
ments of button-pushing responses for any 
of the three conditions: i.e., the XX and 
YY groups, as well as the XY group, all 
press one button to indicate placement of a 
brightness in Class 1, and all press an oppo 
site button to indicate placement of a bright- 
ness in Class 2. The S indicates that he is 
using an exclusion class by as active a re- 
sponse as he makes to indicate use of an 
inclusion class. Further, one cannot argue 


that, whereas the verbal labels “X” and “Y” 
are of equal strength, the label “Yes” is 
stronger than “No.” For our best index of 
the strength of “Yes” and “No,” the Thorn- 
dike-Lorge (1944) frequency count, indi- 
cates that they are both very strong—and, 
if anything, the frequency of “No” exceeds 
that of “Yes.” Although no obvious differ- 
ences in kind of response for inclusion vs. 
exclusion classes were involved in our ex- 
periment, then, the Ss reacted as they might 
if such response differences did occur. This 
once more may be due to transfer from 
cases where use of inclusion vs. exclusion 
classes did involve the sort of response dif- 
ferences described. 

Innate preference for presence rather 
than absence of stimulation. We have just 
noted three reasons why a person might 
well be expected to learn, during his life- 
time, to avoid exclusion classes. Apart from 
such learning, however, there may also be 
some innate basis for this preference of in- 
clusion classes. Several theories of motiva- 
tion—notably those of Freud (1955), Hull 
(1943), and Miller and Dollard (1941) 
have conceived of drives as strong stimuli, 
and reduction of such stimulation intensity 
as a preferred state which organisms will 
strive to achieve. Recently, however, it has 
been noted that many situations exist in 
which organisms prefer stimulation in 
creases rather than decreases. For example, 
Harlow, Harlow, and Meyer (1950) report 
that monkeys solve mechanical puzzles for 
the sheer love of doing it; Butler (1953) 
found that monkeys will solve problems for 
the “reward” of being able to look out of 
their cages; Kish (1955) reports that mere 
onset of illumination will function to 
strengthen a response; and evidence from 
the McGill group (Bexton, Heron, & Scott, 
1954) indicates that mental life rapidly 
deteriorates in a human temporarily de- 
prived of stimulus input through the main 
modalities. Thus, the presence of certain 
kinds of stimulation may be innately pre- 
ferred to their absence. If there were a 
tendency of this sort at the perceptual level, 
it could well be expected to underlie a pref- 
erence for positive rather than negative 
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classes at the conceptual level, since a posi- 
tive class involves the possible presence of 
its defining properties, while a negative 
class involves the absence of those prop- 
erties and the others in 
particular. 


presence of no 


Several suggestions have been made, then, 
as to why people prefer positive to negative 
classes. Any or all of them may actually 
apply. 


On the Relevance of a Contrast Class to a 
Class of Interest 


Our experiment has found that whether a 
contrast class is defined positively or by 
exclusion of the class of interest influences 
the acceptance limits for that latter class. 
The particular effects exercised by positive 
vs. negative definition of the contrast class 
can well be expected from the considera- 
tions outlined above, if one assumes that a 
contrast class will have any influence at all 
on the class of interest. But it is this latter 
question that concerns us here. Why should 
a contrast influence the 
limits for a class of interest 
place ? 


class acceptance 


in the first 


Perhaps the answer is somewhat as fol- 
lows. We have noted elsewhere ( Wallach, 
1958) how necessary it is for the organism 
to select some attributes as a basis for estab- 
lishing equivalence classes and to ignore 
others, in order to avoid being overwhelmed 
by the confusing richness of properties that 
the environment offers the senses. Attention 
to a contrast class would seem to be one 
important mechanism for making such selec- 
tions, since choosing one or another con- 
trast class tends to influence the number and 
kinds of properties in whose terms we define 
the class of interest. Further evidence on 
this point is contained in a recent study by 
Wallach and Caron (in press). 

\lthough, once again, consideration of the 
contrast class does not have such an effect 
in the present experiment since only one 
dimension 


brightness—was used, the dem 


onstrated influence of the contrast class may 


be due to transfer from the many cases a 
person has encountered where contrast does 
serve the function discussed. 


SUMMARY AND CONCLUSIONS 


This study was concerned with some of 
the factors arising from the nature of the 
classification process itself that may influ- 
ence our range of acceptance limits for 
placing events in a category. When it is 
uncertain whether some event should be 
placed in a particular class or not—whether 
this person is a communist, whether that 
student’s grade should be a C, whether there 
was a speck of light on the telescope’s 
screen—what factors, arising from the way 
classes are defined, sway our judgment? 
Two such factors were examined: (a) 
Placing data in a class, we proposed, does 
not go on in a vacuum, but rather requires 
that one consider some contrasting class 
whose nature is used as a basis for deciding 
how one will specify the class of interest. 
In particular, we expected people’s readi- 
ness to assign data to the latter class to be 
greater when the contrast class was simply 
defined by exclusion as whatever remained 
outside the class of interest, than when de 
fined by inclusion as being such-and-such. 
(b) Further, we noted that the class of 
interest itself may be defined by inclusion 
or exclusion—that is, in terms of its being 
such-and-such, or in terms of its not being 
some other class—and we suggested that 
people would be more likely to place events 
in the class of interest when it was defined 
in the former way 


Seventy university students were distrib 
uted among groups designed to test our two 
hypotheses under each of two conditions of 
ambiguity of class. It was found that both 
hypetheses were confirmed : that is, whether 


MICHAEL A 


WALLACH 


the class of interest is defined by inclusion 
or exclusion, and whether the contrast class 
is defined by inclusion or by exclusion of 
the class of interest, both influence the range 
of acceptance limits for placing 
the class of interest. However, 
of the latter factor may be 

greater than that of the former. 
these effects were found to be somewhat 
stronger when the class of interest was 
highly ambiguous than when it was partially 
certain, although present to a considerable 
extent even in the latter case. 

Is it significant that these hypotheses were 
confirmed? Perhaps the best way to answer 
is to consider their origin. It occurred to us 
that many matters which psychologists of 
cognition investigate involve the organism's 
detecting similarities among events 


events in 
the effect 
somewhat 

Further, 


involve 
its acting on the assumption that a present 
event is like something else that happened 
before. We thought further that such find 
ing of similarities might best be conceptual 
ized as a classification process. If so, judg 
ments as to the range of equivalence—the 
range of events considered to be similar to 
one another and hence in a common class 

ought to depend on the way the assumed 
classes are defined. Our hypotheses, in turn, 
concerned 
class 


particular differences in 
definitions 


such 
which were 
in the experiment but 
which should, on our theory, have large 
effects. Confirmation of these hypotheses 
was hence by no means a foregone conclu 
sion, and suggested it may be valuable to 
explore further implications of the idea that 
much of cognitive activitv involves classi 
fication processes. 


differences 


objectively slight 
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